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ABSTRACT 

In bacteria, promoter identification by RNA polymer- 
ase is mediated by a dissociable a factor. The 
housekeeping a 70 factor of Escherichia coii recog- 
nizes two well characterized DNA sequence 
elements, known as the '-10' and '-35' hexamers. 
These elements are separated by 'spacer' DNA, the 
sequence of which is generally considered unim- 
portant. Here, we use a combination of bioinfor- 
matics, genetics and biochemistry to show that a 70 
can sense the sequence and conformation of the 
promoter spacer region. Our data illustrate how 
alterations in spacer region sequence can increase 
promoter activity. This stimulatory effect requires 
a 70 side chain R451, which is located in close prox- 
imity to the non-template strand at promoter 
position -18. Conversely, R451 is not required to 
mediate transcriptional stimulation by improve- 
ment of the -10 element. Mutation of a 70 residue 
R451, which is highly conserved, results in reduced 
growth rate, consistent with a central role in 
promoter recognition. 

INTRODUCTION 

RNA polymerase requires specific DNA sequences known 
as promoters in order to recognize DNA and initiate the 
transcription of a gene. In bacteria, recognition of 
promoters is mediated by a dissociable subunit of RNA 
polymerase known as the a factor (1). Most bacteria 
encode multiple a factors, with different DNA binding 
specificities, and a factor switching therefore represents 
a simple mechanism via which RNA polymerase can be 
directed to different sets of genes (2). The major a factor 
in Escherichia coli is a 70 , which recognizes two DNA se- 
quences known as the —10 and —35 hexamers (3). The —10 



hexamer ^-TATAAT^) is located ~7-bp upstream of 
the transcription start site and becomes single stranded 
during transcription initiation. The —35 sequence (5'-TT 
GACA-3') is usually located 17-bp upstream of the —10 
element and is not unwound during open-complex forma- 
tion (4). A subclass of 'extended —10' promoters have a 
5 -TG-3' motif at promoter positions —14 and —15, 1 bp 
upstream of the —10 hexamer. This serves to further sta- 
bilize RNAP-DNA interactions (5). The major promoter 
elements were originally identified on the basis of similar- 
ity between small numbers of very efficient promoters (6- 
9). Recent large-scale cataloguing of promoters in E. coli 
now permits hundreds of promoter sequences, many of 
which are sub-optimal, to be aligned. The overall conser- 
vation of different promoter elements can then be ascer- 
tained. Figure 1A shows a DNA sequence logo generated 
from the alignment of 554 E. coli promoters produced by 
Mitchell et al. (10). The —10 sequence is clearly the best 
conserved promoter element. Comparatively, the —35 and 
extended —10 elements are poorly conserved. Indeed, 
some sequences in the 1 7 bp spacer region, generally con- 
sidered to be unimportant, are better conserved than parts 
of the —35 and extended —10 elements. 

Interactions between the promoter elements described 
above and RNA polymerase are mediated by specific a 
factor determinants. Primary a factors, such as a 70 , 
consist of two or more conserved domains connected by 
flexible linkers. Sub-regions of domains 2, 3 and 4 mediate 
sequence specific RNA polymerase-DNA contacts. 
Region 2.4 of a 70 domain 2 contacts the —10 element, 
with side chains Q437 and T440 interacting with the 
DNA at promoter position -12 (11,12). The 5 / -TG-3 / 
motif of extended —10 promoters is recognized by side 
chain E458 in a 70 domain 3 (5). Contact with the —35 
element is mediated by domain 4 with amino acid side 
chains R584, E585 and Q589 making the critical inter- 
actions (13). Contacts between RNA polymerase and the 
spacer DNA were detected biochemically over 30 years 
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Alignment of 554 E. coli promoters 
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Figure 1. Promoter sequence, organization and recognition. (A) The panel shows a DNA sequence logo generated from the alignment of 554 E. coli 
promoters produced by Mitchell et al. (10). Different promoter elements are labelled. (B) A structural model of the RNA polymerase holoenzyme- 
DNA complex is shown (12). With the exception of the two a-subunits, each RNA polymerase component protein is shown in a different colour and 
is labelled. The DNA is shown in green with the base pair at position —18 highlighted in red. The expansion shows the close proximity of the loop 
between a 70 domains 2 and 3 and the promoter non-template strand. Residues in a 70 mutated during the course of this work are highlighted and 
labelled in the expansion. 



ago (14-16). However, the nature or role of these contacts 
has not been probed further. 

It is well established that the length of the 'spacer' DNA 
between the —10 and —35 elements is critical (17). Recent 
work has suggested that the sequence of the spacer 
may also be important (18,19). Structural modelling 
of the RNA polymerase holoenzyme-DNA complex 
places the linker between a 70 domains 2 and 3 within 
2 A of the non-template strand at promoter position 
— 18, just upstream of the extended —10 element (12) 
(Figure IB). Interestingly, as the data in Figure 1A 
show, the DNA sequence immediately upstream of the 
extended —10 element is not random. For example, T is 
the preferred base at both positions —17 and —18. In 
this work, we have investigated the role of the spacer 
region, and the linker between a 70 domains 2 and 3, in 
controlling promoter activity. We show that altering 
the base sequence at promoter position —18 modulates 
transcription initiation at many promoters. Moreover, 
mutational analysis reveals that a 70 side chain R451, 



located in the linker between a domains 2 and 3, is 
required to respond to changes in DNA sequence at 
promoter position —18. Substitution of side chain R451 
with alanine results in decreased growth rate consistent 
with R451 playing an important role. 

MATERIALS AND METHODS 

Strains plasmids and oligonucleotides 

Bacterial strains and plasmids are listed in Table 1. 
Standard techniques for recombinant DNA manipula- 
tions were used throughout. Table 2 lists primers used to 
amplify sections of the cbpA regulatory region in such a 
way that it was flanked by EcoRI and Hindlll restric- 
tion sites. After digestion, fragments carrying cbpA regu- 
latory were cloned into pSR, sequenced and then 
sub-cloned into pRW50. The exception to this was the 
screen for 'up' mutations in the spacer region where frag- 
ments were cloned directly into pRW50. We have 
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Table 1. Bacterial Strains and plasmids 



Name 



Description 



Source 



Bacterial strains 
JCB387 
MC4100 

MC4100r/wS::kan 

T7 express (Invitrogen) 

Plasmids 
pSR 

pRW50 

pRW224 
pVRa 

pET21b (Novagen) 



Anir Alac 

F~ araD\39 A(argF-lac)U 169 rpsL\50 relAX deoCX ptsF25 rbsR flbB530l 
MC4100rpoS::kan 

fhuA2 lacZ::T7 genel [Ion] ompT gal sulAll R(mcr-73::miniTnlO-TQt s )2 [dcmj 
R(zgb-210::TnlO-Tet s ) endAl A(mcrC-mrr) 114::IS10 

pBR322-derived plasmid containing an EcoRI-Hindlll fragment upstream of the Xoop 

transcription terminator 
Low-copy number broad-host-range lac fusion vector for cloning promoters on 

EcoRI-Hindlll fragments: contains the RK2 origin of replication and encodes Tc R 
A derivative of pRW50 

pBR322 derivative encoding rpoD and mutant derivatives 
T7 Expression vector containing 6xHis tag 



(20) 
(21) 
(22) 



(23) 

(24) 

(25) 
(26) 



Table 2. Oligonucleotides 



Name 



Sequence 



Source 



Primers to introduce 
-22N cbp A A45 
-2 IN cbp A A45 
-20N cbp A A45 
-19N cbp A A45 
-18N cbpAA45 
-17N cbp A A45 
-16N cbp A A45 
-15N cbpAA45 
-14N cbpAA45 
-13N cbp A A45 



random single base substitutions into the cbpA regulatory region spacer DNA element 
5'GGCTGCGAATTCATATTCTGTGTTGGCATATGAAANTTTGAGGATTACCC 3' 
5'GGCTGCGAATTCATATTCTGTGTTGGCATATGAAATNTTGAGGATTACCCT 3' 
5'GGCTGCGAATTCATATTCTGTGTTGGCATATGAAATTNTGAGGATTACCCTA 3' 
5'GGCTGCGAATTCATATTCTGTGTTGGCATATGAAATTTNGAGGATTACCCTAC 3' 
5'GGCTGCGAATTCATATTCTGTGTTGGCATATGAAATTTTNAGGATTACCCTACA 3' 
5'GGCTGCGAATTCATATTCTGTGTTGGCATATGAAATTTTGNGGATTACCCTACAC 3' 
5'GGCTGCGAATTCATATTCTGTGTTGGCATATGAAATTTTGANGATTACCCTACACT 3' 
5'GGCTGCGAATTCATATTCTGTGTTGGCATATGAAATTTTGAGNATTACCCTACACTT 3' 
5'GGCTGCGAATTCATATTCTGTGTTGGCATATGAAATTTTGAGGNTTACCCTACACTTA 3' 
5'GGCTGCGAATTCATATTCTGTGTTGGCATATGAAATTTTGAGGANTACCCTACACTTAT 3' 



Primers used for site-directed mutagenesis of the cbpA regulatory region 

cbpAA45 -18C 5'GGCTGCGAATTCATATTCTGTGTTGGCATATGAAATTTTCAGGATTACCC 3' 

cbpAA45-9A -10T 5'GGCTGCGAATTCATATTCTGTGTTGGCATATGAAATTTTGAGGATTATACTACACTTATAGG 3' 

Primers used to amplify cbpA promoter inserts cloned in plasmid pSR 
pSR up 5'GCATTTATCAGGGTTATTGTCTC 3' 

pSR down 5' CATCACCGAAACGCGCGAGG 3' 

Primers used to introduce alanine codons into rpoD 

Hindlll oligo 5'GGGGAAGCT2TTAATCGTCCAGGAAGCTACGCAGCACTTCAGAACGGCTCGGGTGACGC 

AGTTTGCGCAGCGCCTTCGCTTC 3' 
R451A 5'CCTGGTGGATCCGTCAGGCGATCACCCGCTCTATCGCGGATCAGGCGCGCACCATC GCT 

ATTCCGGTGCATATGATTGAGACC 3' 
I452A 5'CCTGGTGGATCCGTCAGGCGATCACCCGCTCTATCGCGGATCAGGCGCGCACCATCCGTGCT 

CCGGTGCATATGATTGAGACC 3 ' 
P453A 5'CCTGGTGGATCCGTCAGGCGATCACCCGCTCTATCGCGGATCAGGCGCGCACCATCCGTATTGCG 

GTGC AT ATG ATTGAGAC C 3' 

V454A 5'CCTGGTGGATCCGTCAGGCGATCACCCGCTCTATCGCGGATCAGGCGCGCACCATCCGTATTCCGGCG 

CATATGATTGAGACC 3 ' 

H455A 5'CCTGGTGGATCCGTCAGGCGATCACCCGCTCTATCGCGGATCAGGCGCGCACCATCCGT 

ATTCCGGTG GCT ATGATTGAGACC 3' 

Primer used to amplify the rpoD RA451 allele for cloning into pET21b 

RpoD pET21b Up 5'AGCTCAGCTAGCGAGCAAAACCCGCAGTCACAGCTGAAAC 3' 

Primers used to amplify the LEE1 promoter and derivatives 

LEE up 5'GAATTCTTGACATTTAATGATAATGTATTTTACACATTAGAAAAAAG 3' 

LEE up -18A 5 ' G AAT T C T TG AC AT T T AATG AT AAT AT AT T T T AC AC ATT AG AAAAAAG 3' 

LEE up -18C 5 ' G AAT T C T TG AC AT T T AATG AT AATC T AT T T T AC AC ATT AG AAAAAAG 3' 

LEE up -18T 5 ' G AAT T C T TG AC AT T T AATG AT AATT T AT T T T AC AC ATT AG AAAAAAG 3' 

LEE down 5 / AAGCTTATTCTCTTTTTTCTAATGTGTAAAA 3 ' 



This work 
This work 
This work 
This work 
This work 
This work 
This work 
This work 
This work 
This work 

This work 
This work 

This work 
This work 

This work 
This work 
This work 
This work 
This work 
This work 

This work 

This work 
This work 
This work 
This work 
This work 



Restriction sites used for cloning are shown in bold typeface and are italicized. Mutations introduced during the PCR are underlined. 
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numbered mutations in the cbpA regulatory region with 
respect to the Ea 70 transcription start point (+1) and with 
upstream and downstream locations denoted by '— ' and 
'+' prefixes, respectively. 

Proteins 

Core E. coli RNA polymerase was purchased from 
Epicenter (Madison). To overproduce the a 70 RA451 
protein the rpoD RA451 allele was amplified by PCR 
from plasmid pVRa and cloned into pET21b. 
Preparations of all afactors and derivatives were made 
by overexpression of the cloned rpoD and rpoS alelles in 
T7 express cells (NEB). Inclusion bodies were then 
solublized in 6 M Guanidine HC1, before being dialysed 
into buffer containing 20 mM Tris, lOOmM NaCl and 
10% glycerol. Proteins were bound to a HiTrap QFF 
anion exchange column (Pharmacia) and eluted with a 
linear gradient to 1 M NaCl. RNA polymerase holoen- 
zyme was reconstituted by incubating core RNA polymer- 
ase with equimolar amounts of a 0 and a 38 at room 
temperature for 20 min. 

KMn0 4 footprinting 

Purified Aatll-Hindlll DNAfragments were derived from 
maxi preparations (using a Qiagen maxiprep kit) of 
plasmid pSR carrying the cbpA regulatory region or de- 
rivatives. Fragments were labelled at the Hindlll-end 
using [y- 32 P]-ATP and polynucleotide kinase. Footprints 
were performed at 37°C as in our previous work (27). 
DNA fragments were used at a final concentration of 
10-40nM in buffer containing 20 mM Tris pH 7, lOmM 
MgCl 2 , lOOmM EDTA and 120mM KC1. Footprints 
were analysed on a 6% DNA sequencing gel (Molecular 
Dynamics). The results of all footprints and EMS A ex- 
periments were visualized by exposing the dried gel 
against a Fuji phosphor screen and analysed using a 
phosphorimager and Quantity One software. 

In vitro transcription assays 

The in vitro transcription experiments were performed 
as described previously (28) using the system of Kolb 
et al. (23). A Qiagen maxiprep kit was used to purify 
supercoiled pSR plasmid carrying the cbpA regulatory 
region or derivatives. This template (16ug/ml) was 
incubated in buffer containing 20 mM Tris pH 7.9, 
5mM MgCl 2 , 500 uM DTT, 50 mM KC1, lOOug/ml 
BSA, 200 uM ATP, 200 uM GTP, 200 uM CTP, 10 uM 
UTP with 5 uCi [(oc- 32 P]-UTP. The reaction was started 
by adding purified E. coli Ea 70 and/or Ea 38 . Labelled 
RNA products were analysed on a denaturing 
polyacrylamide gel. 

P-Galactosidase assays 

DNA fragments containing the cbpA regulatory region or 
the locus for enterocyte effacement 1 (LEE1) promoter 
were cloned into pRW50 or pRW224 respectively to gen- 
erate promoter: \lacL fusions. P-Galactosidase levels in 
cells carrying these recombinants were measured by the 
Miller (29) method. Activities are the average of three 



or more independent experiments. Cells were grown 
aerobically in LB media as described. 

DNA bending assays 

DNA fragments generated by PCR were separated by 
electrophoresis on a 7.5% polyacrylamide non-denaturing 
gel. Electrophoresis was performed at 4°C in TBE buffer. 
After electrophoresis, the gels were stained with ethidium 
bromide and DNA was visualized by UV illumination. 
The DNA fragments for this analysis were generated by 
PCR. Thus, the cbpA regulatory region cloned in plasmid 
pSR was amplified using pSR up and pSR down oligos 
(Table 2). The LEE1 promoter and derivatives were 
generated using the oligonucleotides shown in Table 2. 
Note that the upstream and downstream LEE oligos 
overlap and can thus be used to generate a double- 
stranded DNA product without template DNA. 

Modelling of DNA fragments in silico 

Changes in DNA bending were modelled computationally 
using the 'model.it' web server (http://hydra.icgeb.trieste 
.it/dna/model_it.html) using the default parameters (30). 
Predicted DNA structures were downloaded in pdb 
format and PyMOL was used to prepare figures. 

RESULTS 

An E. coli gene regulatory region with overlapping 
promoters 

The aim of this study was to investigate the role of the 
RNA polymerase a 7 subunit in sensing the sequence of 
the promoter spacer. To facilitate this, we chose to work 
with a regulatory DNA region containing overlapping 
promoters, with a shared spacer region, but different a 
factor specificity. Our logic was that DNA sequences in 
the spacer region that make selective interactions with a 70 
should affect only one of the two promoters. Conversely, 
sequences that non-selectively stimulate transcription 
should affect both promoters. Thus, the E. coli cbpA regu- 
latory region contains overlapping promoters for RNA 
polymerase associated with a 70 (Ea 70 ) or a 38 (Ea 38 ) 
(Figure 2A) (31,32). Note that, throughout this work, 
we use the Ea 70 transcription start site as the point of 
reference for numbering mutations in the cbpA regulatory 
region. 

To confirm that the two cbpA promoters were truly 
specific for their cognate a factor we performed KMn0 4 
footprinting, which detects open complex formation by 
RNA polymerase. As expected, Ea 0 and Ea 38 pro- 
duce different patterns of DNA opening that are offset 
(Figure 2B). We also found that Ea 70 had to be added 
at higher concentrations than Ea 38 , suggesting that the 
Ea 70 dependent promoter is less efficient. To further 
confirm the a factor specificity of the two promoters, 
we utilized an in vitro transcription assay. Thus, an 
EcoRI-Hindlll fragment carrying the cbpA regulatory 
region (illustrated in Figure 1A) was cloned into 
plasmid pSR. This places the two promoters upstream 
of the factor-independent Xoop transcription terminator. 
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Table 3. Spacer DNA mutations that increase the activity of the 
cbpA regulatory region 



Mutation 


No. of isolates 


(3-Galactosidase activity 


WT 


N/A 


95 


-14G 


1 


127 


-15T 


2 


126 


-17C 


1 


158 


-17T 


4 


199 


-18T 


3 


249 


-18A 


1 


221 



The table shows P-galactosidase activities obtained from overnight 
cultures of JCB387 cells carrying different cbpA::lacZ fusions in 
plasmid pRW50. Mutations are numbered with respect to the a 70 de- 
pendent cbpA transcription start site (Figure 1A). 
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Figure 2. Overlapping promoters in the cbpA regulatory region. 
(A) Sequence of the cbpA regulatory region. Promoters for Ea 70 (red) 
and Ea 38 (blue) are highlighted. The transcription start sites are 
labelled with arrows and designated as Note that, throughout 

this work, we have numbered all mutations in the cbpA regulatory 
region with respect to the Ea 70 (B) Open complex formation by 
Ea 38 and Ea 70 . The panel shows the results of a KMn0 4 footprinting 
experiment designed to detect open complex formation by RNA poly- 
merase at the cbpA regulatory region. The positions of DNA opening 
by Ea 38 and Ea 70 are highlighted in blue and red, respectively and the 
transcription start sites are shown by arrows. Ea 70 was added in 3-fold 
excess of Ea 38 . (C) In vitro transcription by Ea 38 and Ea 70 . The results 
of an in vitro transcription assay are shown. The lower band corres- 
ponds to the RNAI transcript, which acts as an internal control. The 
upper bands are transcripts that initiate from the cbpA regulatory 
region and are due to transcription by either Ea 38 (blue) or Ea 70 
(red). Ea 70 and Ea 38 were added in equal amounts. 



Transcription in vitro with either Ea 38 or Ea 70 produces 
an RNA product, which can be detected after electrophor- 
esis. As expected, while Ea 70 functions less efficiently 
than Ea 38 , it produces a longer transcript (Figure 2C). 
Note that, the plasmid pSR replication origin encodes 
the 108 base RNAI transcript that acts as an internal 
control. 

Isolation and analysis of spacer region mutations that 
stimulate transcription 

To investigate the role of the promoter spacer, we created 
a library of DNA fragments carrying the cbpA regulatory 
region. The library was prepared so that only random 
single base mutations, introduced into the portion of the 



cbpA spacer region shared by Ea 70 and Ea 38 , were 
selected. Multiple base changes were not permitted in 
order to exclude spacer regions with completely altered 
properties (e.g. very A:T rich spacer sequences). The 
library of DNA fragments that we prepared was then 
cloned upstream of lacZ in the plasmid pRW50 to 
create a library of lacZ fusion plasmids. This library was 
used to transform Lac - JCB387 cells and transformants 
were plated on MacConkey agar medium. Note that, 
when fused to lacZ in plasmid pRW50, the wild type 
cbpA regulatory region stimulates only low levels of lacZ 
expression. Thus, JCB387 cells transformed with this 
plasmid construct have a Lac - phenotype (i.e. appear 
white) on MacConkey indicator plates. This was also 
true for the majority of the 216 regulatory region deriva- 
tives screened from our library. However, 12 regulatory 
region derivatives resulted in a Lac + (red) phenotype. 
Colonies with a Lac + phenotype were purified by 
restreaking, levels of lacZ expression were quantified 
using P-galactosidase assays, and finally the sequence of 
the cbpA regulatory region insert was determined. The 
results of this analysis are displayed in Table 3. The data 
show that introduction of a T at promoter positions —18 
and —17 (with respect to the Ea 70 transcription start) had 
the biggest stimulatory effect on transcription and were 
most frequently isolated. The — 18A and — 17C mutations 
also stimulated transcription, but to a lesser extent. 
Interestingly, mutations that optimized the extended —10 
element (— 14G and — 15T) had the smallest stimulatory 
effect. Note that, our findings are largely consistent with 
the alignments presented in Figure 1A. 

Mutations at position —18 stimulate transcription by Ea 70 

Of all the spacer region derivatives that we identified, the 
substitutions at position —18 had the biggest stimulatory 
effect on transcription. Thus, we next sought to determine 
whether the base sequence at position —18 affected both 
of the overlapping cbpA promoters or specifically one of 
the two promoters. To do this, the EcoRI-Hindlll frag- 
ments containing the wild type, — 18T, or — 18A deriva- 
tives of the cbpA regulatory region were cloned into 
plasmid pSR. We also used site directed mutagenesis to 
make a — 18C derivative of this construct. Transcription 
was then measured in vitro with either purified Ea °, or 
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Figure 3. Mutations in the cbpA spacer region have differential effects 
on transcription by Eg 70 and Eg 38 . (A) Effect of mutations at position 
— 18 on transcription by Eg 38 and Eg 70 in vitro. The gel shows tran- 
scripts produced in vitro by Eg 38 and Eg 70 from the cbpA regulatory 
region and derivatives. (B) Activity of cbpA regulatory region deriva- 
tives in vivo. The bar chart shows P-galactosidase expression driven by 
different cbpA regulatory region derivatives, cloned in plasmid pRW50, 
in MC4100 and the rpoS::kan derivative. Values for 'no promoter' were 
obtained using pRW50 carrying no promoter insert. Assays were done 
using overnight cultures. 



Ea 38 , to test the specificity of the mutations. The results 
(Figure 3 A) show that all of the mutations at position —18 
stimulate transcription by Ea 70 (compare lanes 2,4,6 
and 8) while not stimulating transcription by Ea 38 
(compare lanes 1, 3, 5 and 7). To confirm our observa- 
tions, the various promoter derivatives, cloned into the 
lacZ expression vector pRW50, were used to transform 
MC4100 or MC4\00rpoS::kan cells. As expected, 
P-galactosidase expression driven by the wild-type cbpA 
regulatory region is almost totally dependent on Ea 38 . 
Conversely, all of the regulatory region derivatives with 
mutations at position —18 showed an increased depend- 
ence on Ea 70 (Figure 3B). 

To confirm that base changes at position —18 did not 
create an artificial promoter, we checked the positioning 
of open complexes formed by RNA polymerase using 
KMn0 4 footprinting. The data show that the — 18T substi- 
tution stimulates open complex formation (Supplementary 
Figure SI) and that the open complex is not repositioned 
(compare Figure 2B and Supplementary Figure SI). 



Effect of spacer region mutations on DNA bending 

DNA fragments of equivalent length, but with different 
base sequences, can have different migration properties in 
native PAGE. These differences are due to changes in in- 
trinsic DNA curvature. Thus, DNA fragments carrying 
the wild type, -18T, -18A and -18C, derivatives of the 
cbpA regulatory region were subjected to native PAGE 
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Figure 4. Changes in DNA conformation induced by mutations in the 
cbpA spacer region. (A) Derivatives of the cbpA regulatory region, with 
different sequences at position —18, have different mobility on a 7.5% 
native acrylamide gel. (B) Predicted topology of the wild-type (green), 
-18C (blue), -18A (pink) and -18T (yellow) derivatives of the cbpA 
regulatory region. Position —18 is highlighted by an arrow. 



analysis. The results confirm that these fragments have 
different mobility, consistent with altered DNA bending 
(Figure 4A). The wild- type DNA fragment was least 
mobile while the — 18T derivative was most mobile. The 
— 18A and — 18C fragments had an intermediate mobility. 
Consistent with this, computational modelling of DNA 
topology for the different sequences predicted changes in 
conformation of the double helix centred around the —18 
position (Figure 4B). 

A a 70 side chain R451 is required to sense changes in 
spacer region sequence and conformation 

Our data show that altering the sequence of the a 70 -de- 
pendent cbpA promoter at position —18 alters intrinsic 
promoter activity. This is intriguing since the linker 
between domains 2 and 3 of a 0 is within 2 A of the 
DNA at promoter position —18 (12) (Figure IB). 
Moreover, Fenton et al. (33) previously suggested that 
a 70 side chain R451, located in this linker, contacts the 
DNA upstream of the —10 element in the spacer region. 
Thus, we reasoned that changes in promoter conformation 
at position —18 might affect this a 70 -DNA contact. To 
test this, we introduced alanine substitutions into the rpoD 
gene, encoded by plasmid pVRa, at the positions high- 
lighted in Figure IB. The starting pVRa plasmid and de- 
rivatives were then used to transform MC4100r/?6>*S::kan 
cells carrying the — 18T version of the cbpA regulatory 
region in plasmid pRW50. Most of the substitutions in 
rpoD had no effect (data not shown) but the RA451 sub- 
stitution drastically reduced the activity of the — 18T 
promoter [Figure 5A(i)]. We could not test the effect of 
the RA451 mutation on transcription from the wild type 
cbpA regulatory region; Ea 70 drives hardly any LacZ 
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B i cbpA -18T jj cbpA -9A-10T 




Figure 5. Stimulatory effects of the — 18T substitution requires a 70 side chain R451. (A) The bar chart shows |3-galactosidase expression driven by 
the — 18T (i) and — 9A— 10T (ii) derivatives of the cbpA regulatory region, cloned in plasmid pRW50, in MC4100r/>oS::kan cells carrying either pVRa 
or pVRa RA451. Values for 'no promoter' were obtained using pRW50 carrying no promoter insert. Assays were done using overnight cultures. 
(B) The gels show transcripts produced in vitro from the — 18T (i) and — 9A — 10T (ii) derivatives of the cbpA regulatory region by purified Eg 70 and 
Eg 70 RA451. RNA polymerase was added at a concentration of 80, 160 or 240 nM. 



expression from this promoter (compare 'no promoter' 
and 6 wt' in Figure 3B). Thus, as a control, we instead 
used a derivative of the cbpA regulatory region with an 
improved —10 element for Ea 70 (due to the substitutions 
-9A and -10T). As expected, the data show that RA451 
Ea 70 is functional at the — 9A— 10T derivative of the cbpA 
regulatory region [(Figure 5A(ii)]. To confirm our obser- 
vations we also measured transcription in vitro with 
purified Ea 70 or the RA451 derivative. The data confirm 
that RA451 Ea 70 is defective at the -18T promoter but 
not the — 9A— 10T promoter (Figure 5B). 

Role of promoter position —18 and a 70 side chain R451 at 
the LEE1 promoter 

We next sought to determine whether similar phenomena 
could be reproduced at different promoters. The E. coli 
0157 LEE1 promoter drives expression of genes in the 
locus for enterocyte effacement (LEE). A maximal level 
of transcription from the LEE1 promoter requires the 
GrlA transcriptional activator but substantial basal 
levels of transcription are observed in E. coli K-12 cells 
that do not encode grlA. Recently, Islam et al. (25) 
isolated a LEE1 promoter derivative with increased 
GrlA-independent activity due to a G to A substitution 
at promoter position —18 (Figure 6A). Thus, we 
investigated the possibility that a 70 side chain R451 
might be important for mediating this effect. The data 
show that, with the starting LEE1 promoter, RA451 



Ea was not defective. In contrast, while the — 18A 
LEE1 promoter had increased activity, this increase was 
lost with RA451 Ea 70 (Figure 6B). Computational (Figure 
6Ci) and native PAGE analysis (Figure 6Cii) confirm that 
base substitutions at position —18 of the LEE1 promoter 
alter DNA bending. 

A a 70 side chain R451 is conserved and required for 
optimal rates of cell division 

Alignment of primary a factors from different bacteria 
reveals that R451 is highly conserved, consistent with it 
playing a key role in RNA polymerase function (13). We 
found that E. coli cells transformed with pVRa, carry- 
ing the RA451 rpoD allele, displayed a growth defect 
(Figure 7). Thus, a 70 RA451 must be able to compete 
with wt a 70 (chromosomally encoded) for binding to 
core RNA polymerase. Once bound to RNA polymerase 
defects in DNA binding, presumably result in disrupted 
patterns of gene expression and thus impede growth. 



DISCUSSION 

Bacterial promoters have been the subject of intense in- 
vestigation for decades. Recently, several studies have 
focused on the role of the promoter spacer sequence in 
controlling transcription. Thus, Liu et al. (18) showed 
that an 8 bp sequence upstream of the —10 element 
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-35 element -18 -10 element 

LEE1 T GTTGACA T TTAATGATAATGTATTT TACACA T TAGA 

LEE1 -18A TGTTGACAT T T AAT G AT AAT AT AT T T T ACACAT TAGA 




WT -18A 

LEE1 r.lacZ fusion 



C 



WT 




Figure 6. The LEE1 promoter responds to changes in the spacer region 
at position —18. (A) Sequence of the LEE1 promoter and — 18A de- 
rivative. The —10 hexamer, —35 element and promoter position —18 
are highlighted. (B) The bar chart shows |3-galactosidase expression 
driven by the wild type and — 18A derivatives of the LEE1 regulatory 
region in JCB387 cells transformed with either pVRa or pVRa RA451. 
Measurements were taken in mid-log phase using the LEE20-203 pro- 
moter: \lac fusion described by Islam et al. (25). (C) (i) Predicted 
topology of the wild type (green), -18C (blue), — 18A (pink) and 

— 18T (yellow) derivatives of the LEE1 regulatory region. Position 

— 18 is highlighted by an arrow, (ii) Derivatives of the LEE1 regulatory 
region, with different sequences at position —18, have different mobility 
on a 7.5% native acrylamide gel. 



could stimulate transcription from the lac promoter. 
Similarly, Hook-Barnard and Hinton (19) showed that 
region 1.1 of a 70 was important for mediating the effects 
of changes in spacer region sequence. Here, we show that 
there is a clear T>A>C>G preference at promoter 
position —18, on the basis of promoter sequence align- 
ments (Figure 1A) and biochemical/genetic experiments 
with three different E. coli promoters (Table 3, Figure 6, 
Supplementary Figure S2). Scrutiny of the literature 
reveals many other consistent observations. For 
example, Busby et al. (34) showed that a G>A substitu- 
tion at position —18 stimulated transcription from a 
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Figure 7. Mutation of a 70 side chain R451 induces growth defects. The 
graph shows growth curves for cultures of JCB387 cells transformed 
with either pVRa (solid line) or pVRa RA451 (dashed line). Cells were 
grown in LB medium with vigorous aeration at 37° C. The experiment 
was repeated three times and error bars show the standard deviation of 
the recorded OD 650 values. 



modified gal P2 promoter. At the gap Al promoter a 
T>G substitution at position —18 reduces promoter 
activity 3-fold both in vivo and in vitro (35). The 8 bp 
motif identified by Liu et al. (18) has a T at position 

— 18 that is essential for activity. Furthermore, Mitchell 
et al. (10) showed a preference for 5'-TTT-y trinucleotide 
sequences to be centred at position —18. Similar observa- 
tions have also been made at the rrnB PI promoter (36) 
and the dps promoter (Supplementary Figure S2). We note 
that the contribution of the base sequence at promoter 
position —18 may be dependent on overall promoter 
strength, conformation, and could also be influenced by 
transcriptional regulatory proteins. Thus, while it is 
unlikely that the base sequence at promoter position 

— 18 plays a central role at all promoters, there are 
clearly numerous instances where it is important. 

Structural and biochemical studies show that the loop 
between domains 2 and 3 of a 70 is correctly positioned to 
contact position —18 of the promoter non-template strand 
(12,37). Siebenlist and Gilbert (15) reported 'extensive' 
RNA polymerase contacts with the DNA in this region 
of the phage T7 A3 promoter. Similar observations have 
been made at the lacUY5 and the A.cl7 promoters (14,16). 
Side chain R451 of a 70 is proximal to the DNA backbone 
at position —18 and Fenton et al. (33) showed that an 
RS451 substitution abolished the promoter DNA 
binding activity of RNA polymerase in vitro. It was 
concluded that the R451-DNA interaction was non- 
specific. Our data do not exclude the possibility of a 
non-specific interaction with the DNA backbone. 
Indeed, we suggest that changes in DNA conformation 
centred at position —18 (Figure 4) can modulate the 
R451-DNA backbone contact, giving rise to pseudo 
sequence specificity. The extreme deleterious effect of the 
RS451 substitution constructed by Fenton et al. (33) may 
result from introduction of a polar serine side chain 
adjacent to the DNA backbone. Consistent with this, 
a 70 subunits with the RA451 substitution are still func- 
tional (Figures 5 and 6). In summary, this work describes 
a mechanism via which RNA polymerase can sense 
changes in the sequence and conformation of the 
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promoter spacer region. We note that our data also show 
how the sequence of the spacer region can play a key role 
in allowing RNA polymerase to differentiate between 
overlapping promoters (Figure 3). Intriguingly, the muta- 
tions that we have characterized at position — 1 8 are only 
one-half a helical turn away from sites of DNasel and 
singlet oxygen hypersensitivity that occur at —23 and 
—24 in open complexes (38). 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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